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INFORMATION-EXTREME MACHINE LEARNING 
OF AN OPHTHALMIC DIAGNOSTIC SYSTEM WITH 
A HIERARCHICAL CLASS STRUCTURE 


Abstract. The paper considers the method of hierarchical information-extreme machine learning for the system 
of ophthalmic diagnosis of eye pathology. Since the proposed method is developed within the framework of a 
functional approach to modeling the cognitive processes of natural intelligence, it, unlike neuro-like structures, acquires 
the properties of flexibility when retraining a diagnostic system and requires an order of magnitude fewer image 
samples. In addition, the decision rules based on the results of machine learning within the geometric approach in the 
form of a binary hierarchical structure of recognition classes ensure their practical invariance to the multidimensionality 
of both the space of diagnostic features and the alphabet of recognition classes. The modified Kullback-Leibler 
information measure, which is considered as a function of the accuracy of classification solutions, is chosen as a 
criterion for optimizing the parameters of the machine learning system for diagnosing eye pathologies. A hierarchical 
information-extreme machine learning algorithm for an ophthalmic diagnostic system for six eye pathologies was 
developed and programmatically implemented. Based on the results of functional diagnostics, it has been 
experimentally proved that the constructed decision rules are error-free according to the training matrices of recognition 
classes of each level of the constructed binary hierarchical structure. 

Keywords: information-extreme machine learning, functional categorical model, information criterion, input 
mathematical description, training matrix, diagnostic system, eye pathology. 


Introduction effective systems for computerized 

The diagnosis of ophthalmic diseases diagnostics of eye pathologies is a complex 
plays a pivotal role in preserving vision and computational task that requires the use of 
providing timely treatment. A multitude of modern methods of machine learning and data 
examination methods are employed to mining. Despite the demonstrated efficacy of 
ascertain the correct diagnosis, encompassing artificial neural networks, probabilistic 
both non-invasive (slit lamp examination, models, and clustering methods in retinal 
visual acuity, fundus imaging, ultrasound, and image analysis, the quest for optimal 
optical coherence tomography) and invasive approaches to diagnosing ophthalmic diseases 
(e.g., fluorescein angiography) techniques [1— persists as a pressing challenge. 
3]. Non-invasive techniques, such as fundus The paper proposes a_ hierarchical 
image analysis and_ optical coherence information-extreme machine learning 
tomography, are particularly promising due to method adapted for eye image analysis. The 
their safety, ease of use, and the ability to developed approach allows diagnosing a 
store the results for further analysis and number of typical diseases, such as optic 
monitoring of the course of disease [4, 5]. nerve atrophy, glaucoma, retinal detachment, 

The development of computer optic neuritis, and retinitis pigmentosa. 
technology offers the potential for automated 
analysis of ophthalmic examination results, Problem statement 
with the aim of improving diagnostic Let's consider the problem _ of 
accuracy and reducing the workload of information synthesis of a learning system for 


doctors. However, the implementation of 
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ophthalmic diagnosis of eye pathologies from 
images. 

Let the alphabet {x°|m=1,M}, of 
recognition classes be given, each of which is 
characterized by a representative set of 
images of the object of the corresponding 
pathology. The input training matrix 
| vy? |i=1,N; 7 =1,J ||, in which the i-th column 


of the matrix is a training set of values for the 
i-th diagnostic feature and the j-th row is a 
structured vector of features of the recognition 
class x? , is formed during image processing 


by reading the values of the brightness 
gradations of the pixels of the receptor field. 
According to the concept of Information 
Extreme Intelligence (IEI) technology, the 
input training matrix is transformed into a 
working binary matrix, which is adapted 
during the machine learning process to 
achieve maximum accuracy of diagnostic 
decisions. Let's consider machine learning 
with a second level of depth. At the first level, 
the optimal (in the information — sense) 
geometric parameters of the hyperspherical 
containers of the recognition classes are 
determined, and at the second level, the 
system of control tolerances for diagnostic 
features is determined. At the same time, the 
vector of functional parameters, which 
influence the functional efficiency of a 
machine learning system for recognition of 


oO 


feature vectors, e.g. a recognition class X hm, ? 


has the form 


8), =<X pm, 4 SOg>s (1) 


h,m, 
where hf is the level of the hierarchical 
structure; m, 1s the number of classes for 
which decisive rules are formed at the h 
hierarchical level in the form of 
hyperspherical containers (x ,,,,, — center and 


d 


hyperspherical container x,,,,,); 0, — the 


radius of the corresponding 


h,my, 


system of control tolerances is formed at the h 
hierarchical level. 

Certain restrictions are imposed on the 
vector of system functioning parameters, 
which will be further referred to as machine 
learning parameters: 
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- the radius d;,,,, of the hyperspherical 


container for the recognition class X‘, He 


must be less than the distance between its 
center and the center of the nearest container 
of the same hierarchy level; 

- the range of acceptable values of the 
machine learning parameter 6, is limited by 
the inequality 6, <6, /2, where O, is a 
system of normalized tolerances. 

In the process of machine learning an 
ophthalmic diagnostic system, it is necessary 
to: 

1) optimize the parameters of vector (1) 
by maximizing the information criterion 


oe 5 
averaged over all recognition classes {X, ,,, } 


aes CE ‘! 
E, =— max E,, (d,,); (2) 
M h=1 m,=1 Gy Gy, sp fats a ve 
2)using the optimal geometric 
parameters of containers for recognition 


classes obtained in the process of machine 
learning, build decision rules for each level of 
the hierarchical structure that ensure high 
overall accuracy of diagnostic decision- 
making. 

3) at the testing stage, it is necessary to 
decide whether the implementation belongs to 
one of the recognition classes at the final level 
of the formed hierarchical structure. 

Thus, the task of the information- 
extreme synthesis of a learning-based eye 
disease diagnostic system is to optimize its 
machine learning parameters by maximizing 
the value of the global information criterion 
(2) to its theoretical maximum value. 


Analysis of recent research and 
publications 
The rapid development of artificial 


intelligence and machine learning 
technologies opens new opportunities for 
automated analysis of ophthalmic 


examination results and improved accuracy of 
eye disease diagnosis [1-3]. Ophthalmology is 
actively researching the use of modern 
machine learning and pattern recognition 
methods to diagnose and monitor multimodal 
eye diseases [4-6]. Analysis shows that the 
most commonly used methods in ophthalmic 
research are random forests and support 
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vectors [7-9]. Convolutional neural networks 
have been widely used in medical image 
recognition tasks, including eye images [10, 
11]. However, there are still unresolved issues 
to ensure high functional efficiency of known 
machine learning methods in the context of 
multidimensional space of diagnostic features 
and significant overlap in this space of 
recognition classes characterizing different 
pathologies. The problematic task is still to 
provide machine learning methods with 
flexibility while increasing the power of the 
recognition class alphabet. In addition, the 
accuracy of machine learning using known 
methods depends significantly on the power 
of the recognition class alphabet, which 
requires a transition from linear data 
structures to hierarchical ones that ensure the 
construction of error-free decision rules 
according to the training matrix. 

A promising way to overcome the 
above scientific and methodological 
difficulties is to use the ideas and methods of 
the so-called information-extreme intelligent 
technology (IET) of data analysis created by 
the scientific school of Anatolii Dovbysh, 
which is based on maximizing — the 
information capacity of the recognition 
system in the process of machine learning 


[12-14]. Information-extreme machine 
learning methods, unlike  neuro-like 
structures, are developed within the 


framework of a functional approach to 
modeling the cognitive processes of natural 
intelligence in the construction and adoption 
of classification decisions [15]. 


The purpose of the study 

The aim of this work is to improve the 
accuracy of information-extensive machine 
learning for ophthalmic diagnostic systems by 
building a hierarchical binary structure of 
diagnostic data. Achieving this goal allows us 
to move from linear multi-class machine 
learning to two-class machine learning for 
each level of the binary tree, which is a 
prerequisite for improving the accuracy of 
machine learning. 


Materials and methods of the study 
The information synthesis of a learning 
ophthalmic diagnostic system is considered 
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within the framework of IEI-technology by 
optimizing the machine learning parameters 
given by vector (1) according to the 
information criterion. The main way to 
overcome the negative impact of the 
multidimensionality of the recognition class 
alphabet on the accuracy of machine learning, 
which is determined by the full probability of 
making correct classification decisions, is to 
switch from a linear data structure to a 
hierarchical one. As a result, instead of linear 
multi-class machine learning, two-class 
machine learning is performed for the 
recognition classes of each level of the binary 
hierarchical structure. In this case, at the same 
level of depth and with the same dimension of 
the diagnostic feature space, the necessary 
conditions are created to improve the 
accuracy of machine learning. At the same 
time, unlike neural networks, the depth in 
information-extreme classifiers is determined 
by the number of machine learning 
parameters that are optimized by the 
information criterion. 

The functional categorical model of 
information-extreme machine learning based 
on a hierarchical data structure is represented 
as an oriented graph, the attributes of the 
vertices of which are the sets involved in the 
machine learning process. At the same time, 
the edges of the graph are operators that map 
the corresponding set to another. The input 
mathematical description of the ophthalmic 
diagnostic system is considered in the form of 
a structure 


Ip =<G,T,Q,Z,H, YX" 6. f, fy>, 


where G — is the set of factors that affect the 
diagnostic system; T — is the set of moments 
of information reading; © — is the space of 
diagnostic features; Z — is the alphabet of 
recognition classes; H — is the hierarchical 
data structure; Y'! — is the input (Euclidean) 
training brightness matrix of two recognition 
classes of the h-th level of the binary tree; 
X'"! _is the working binary training matrix of 
two recognition classes of the h-th level of the 
binary tree given in the Hamming space; g — 
is the operator of constructing the hierarchical 
data structure H; fi — is the operator of 
forming the input training matrix; fo — is the 
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operator of converting the matrix Y" into a 
binary training matrix X!"'. 

The functional categorical model of the 
hierarchical information-extreme machine 


u 


learning system for ophthalmic diagnosis is 
shown in Figure 1. 
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Fig. 1. Functional categorical model of hierarchical machine learning 


In Figure 1, the source of information is 
the Cartesian product of sets TxGxQxZ. 
The term set E contains the values of the 
information criterion calculated at each step 
of machine learning. In accordance with the 
principle of complete composition, this set is 
common to all cycles of optimization of 
learning parameters. At each learning step, the 
operator r: E> Rl restores the 
hyperspherical containers of the recognition 
classes in the radial basis of the binary 
Hamming feature space, forming, in general, 
their fuzzy partitioning “|. The operator 
& maps the binary vectors of the working 
matrix X'"! to the partition §!”"!, Next, the 
y:X, 01 Mn the main 
irae eX), The 
operator ’ calculates the set of precision 
characteristics 3”!, where g, =1;7. At each 


operator tests 


Statistical hypothesis 


step of machine learning, the operator 


calculates the information criterion of 
optimization, which is a function of the 
precision characteristics. The cycle of 


optimization of control tolerances is closed 


through the set Dp", whose elements are the 
values of control tolerances for diagnostic 


features. The operator uw, regulates the 
machine learning process. 
Thus, the proposed functional 


categorical model of information-extreme 
machine learning optimizes the machine 
learning parameters of the ophthalmic 
diagnostic system for each level of the 
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hierarchical structure specified by vector (1). 
At the same time, the machine learning 
process automatically generates a_ binary 
hierarchical data structure. 

The process of information-extreme 
machine learning based on a hierarchical data 
structure is performed in the following main 
stages: 

1) based on the input training matrix for 
a given alphabet of recognition classes, the 
average structured vectors {y,, |m, =1,M,} 


of diagnostic features are calculated; 

2) for each level of the binary tree, the 
parameter 5, of the field of control tolerances 
for diagnostic features is set. For each i-th 


feature of the vector y, , the lower A 


m, ? HKmy ,i 


and upper Aj,,,, ; control tolerances for 


diagnostic features are calculated by the 

formulas 
HKm,, .i = ine i —6, > Aszkm, J a Y ii al F 6, > 

3) a set {xm} of binary averaged vectors 


of diagnostic features is formed according to 
the following rule 


_ 1, if AuKm, i < Yim, i . ABR, i° 


m,i 


0, else; 


4) at the first (highest) level (h=1) of the 
hierarchical structure, the training matrices of 
recognition classes characterized by the 
highest maximum value of the average 
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information criterion {FEij m1}, where m, = 
1..M, are selected as vertex attributes. 

5) for each subsequent (lower) level, the 
vertex attributes are selected similarly, but 
from the training matrices of recognition 
classes that are not used on the higher levels 
of the binary tree. 

6) the process of building the tree 
continues until all levels of the binary tree are 
formed. 

Thus, the above scheme for each level 
creates the necessary conditions for the 
construction of highly accurate decision rules 
by means of information-extreme two-class 
machine learning using a linear algorithm. 

According to the functional categorical 
model (Fig. 1), the algorithm for information- 
extreme machine learning of the system for 
diagnosing eye diseases based on_ the 
hierarchical data structure can be represented 
for the h-th level of the binary tree as an 
iterative procedure for finding the global 
maximum of the criterion (2) averaged by 
alphabetical x7, } recognition classes: 


(3) 


Ors =argmax{ max En(d,)}. 
Gs. GeOGa, 


Unlike a linear algorithm, where the 
optimal value of the parameter 6, is 
determined for the entire set of recognition 
classes, in information-extreme machine 
learning based on a_ hierarchical data 
structure, the parameter 6, is determined 
separately for each level of the hierarchy. 

The internal loop of procedure (3) 
implements a basic algorithm that performs 
the following functions at each step of 
machine learning: 

- calculates the value of criterion (2); 


- finds the global maximum of this 
criterion; 
- determines the optimal geometric 


parameters of hyperspherical containers for 
recognition classes. 

Input data of the basic algorithm: an 
array of realizations 


{y\) |m, =1.M,3i=LN, j=Ln}, a system 


ey 


of control tolerances {6,,) for diagnostic 


1 Pr, ol } of 
averaged feature 


features, selection levels 


coordinates of binary 
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vectors, which are equal to p,,; = 0.5 by 
default. 
The geometric parameters of the 


containers of the recognition classes are 
optimized using the basic machine learning 
algorithm according to the procedure [13]: 

1) formation of the input structured 
training matrix of recognition classes of the h- 
th level of the binary tree; 

2) determination of the averaged vector 
realizations of recognition classes; 

3) formation of a binary training matrix 
according to a given system of control 
tolerances for diagnostic features; 

4) determination of the averaged binary 
feature vectors by statistical averaging of the 
corresponding binary training samples; 

5) calculation of center-to-center 
distances in a given recognition class alphabet 
as code distances between the averaged 
feature vectors of recognition classes. 

6) calculation of the average 
information criterion for optimizing machine 
learning parameters at each training iteration; 

7) searching for the global maximum of 
the averaged information criterion in the 
working area of determining the criterion 
function; 

8) determining the optimal radii of the 
containers of recognition classes by restoring 
them in the radial basis of the diagnostic 
feature space using an iterative procedure at 
each step of machine learning 


(4) 


! = —— 
dim, =arg max En(d,,,,,),m=1M,, 
BOG, 


9) STOP. 

Procedure (3) of forming a_ binary 
training matrix according to the system of 
control tolerances for diagnostic features is 
implemented in an outer loop with the 
operator of changing the parameter 6, of the 


tolerance field until the maximum value of the 
information criterion for optimizing machine 
learning parameters is reached. 

The modified Kullback-Leibler measure 
was used as a criterion for optimizing the 
machine learning parameters of the 
ophthalmic diagnostic system for each level 
of the hierarchical structure of the input data, 
which for  two-probability _ alternative 
hypotheses has the form [13, 16, 17]: 


ISSN_ 2710 - 1673 Artificial Intelligence 2024 Ne 3 


the process of implementing the machine 


1 
(kK) _ F959 _ py’) (k) 
Em = > (2 -[@h mA) + Brim(DI1} x learning algorithm. 


2 Ta (a) + B® (ad) 4107? Based on the optimal geometric 
xlog, sat) nid) as nh) = ee parameters of the hyperspherical containers of 
Bi m(A) + Brim(@)]+10 the recognition classes obtained in the process 


of machine learning, constructed the decision 


Due to the limited size of the training rules in the form of implicit relations [18, 19]: 


sample, estimates of the accuracy 
characteristics were used in the calculation of 
the information optimization criterion (5) in 


(XE, ERM NV ERM) LF [C1 > 0) & 


& = max m, =1,2}] then x € X? 
UE ta {Ln | h }] h,m,, (6) 


elsex? ¢X? }. 


h,my, 


In expression (6), the function up, of vector «‘” membership in the hyperspherical container 
of the recognition class X;;,,, 1s defined by the formula 


Thus, the vector of diagnostic features 
x” 1s classified as belonging to the class 
from the given alphabet of the corresponding 
hierarchy level for which the membership 
function (6) takes on a positive maximum 
value. In addition, the decisive rules (6), 
constructed within the framework of the 
geometric approach, allow for real-time 
diagnostic decisions, which is important for 


functional diagnostics. d 
Experiments Fig. 2. Images of eye pathologies: a — class X ie 
Let's consider information-extreme _ ; = 
machine learning for an ophthalmic b—class X3;c-class X3;d—class X4; 


diagnostic system. As input, we used images 


: : : : e—class X§ ;f—class X@ 
of six typical eye pathologies representing the 


corresponding recognition classes. These Below are the recognition classes that 
classes are ordered according to the proposed form the specified recognition class alphabet: 
methodology for forming a variation series, 1)normal condition of the eye 


the visualization of which is shown in Fig. 2. 
These images served as a test set for 
validating the developed machine learning - 
approach in the context of the task of (recognition class X>); 

computerized diagnosis of eye diseases. 3) retinal detachment (recognition class 


X3); 


(recognition class X,’); 
2) atrophy of the optic nerve 
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4) glaucoma (recognition class X{ ); 


5) optic neuritis (recognition class X?); 


6) retinitis pigmentosa (recognition 
class X¢); 
The input training matrix for the 


ophthalmic diagnosis system was formed by 
reading the pixel brightnesses of fragments of 
eye pathology images with 100x100 pixel 
size The value of each feature in the input 
matrix corresponded to the brightness of the 
corresponding image pixel in the range from 0 
to 255. Given the assumption that the images 
are stationary in brightness, they were 
scanned in the Cartesian coordinate system. In 
addition, its transposed matrix was added to 
the input training matrix, which doubled the 
dimension of the diagnostic feature space. 


E 
2,5 


2,0 
1,5 
1,0 
0,5 


0,0 
0 5 10 15 


Fig. 3. Graph of dependence of the information 


In Figure 3 and further in the text, 
double hatching indicates the working areas 
for determining the criterion function (5), in 
which the first confidence exceeds 0.5 and the 
second kind of error is less than 0.5. The 
analysis of Figure 3 shows that the optimal 
value of the parameter of the control tolerance 


field is equal 6,,=28 (in brightness 


This approach increased the amount of 
information and, in accordance with the 
maximum-distance principle of pattern 
recognition theory, created the necessary 
conditions for increasing the average 
interclass code distance. 


Analysis of the results 

For the initial alphabet of recognition 
classes, a linear information-extreme machine 
learning algorithm with parallel optimization 
of control tolerances for diagnostic features 
was implemented. Figure 3 shows a graph of 
the dependence of the information criterion 
(5) on the parameter of the control tolerance 
field obtained in the process of machine 
learning. 


20 25 30 35 


criterion on the parameter of the control tolerance field 


gradations) to the maximum value of the 


information criterion 1,1 =1,47. 


To build the decision rules (6), it is 
necessary to determine the optimal geometric 
parameters of the hyperspherical containers of 
the recognition classes. Figure 4 shows the 
graphs of the dependence of the information 
criterion (5) on the radii of the containers of 
the recognition classes. 
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a b 
E, E, 
2,5 2,5 
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0 20 40 d. 40 d, 
e 


Fig. 4. Graph of dependence of the information criterion 


on container radii: a — recognition class x’; b — recognition 


class X33 c—recognition class X?; d—recognition class X?; e— recognition class X?; f— recognition class xX? 


The results of the functional testing of 
the ophthalmic diagnosis system based on the 
developed decision rules showed that the full 
probability of correct classification is 0.83. To 
improve the accuracy, an information-extreme 


tolerance system, in which the control 
tolerances obtained from the results of parallel 
optimization were taken as the starting ones. 
Figure 5 shows a graph of changes in the 
average information criterion in the process of 


machine learning algorithm was implemented parallel-serial optimization of control 
for sequential optimization of the control tolerances. 
E 
45 
4,0 
3,5 
3,0 
2,5 LES 
2,0 Sen 
- eter 
1,0 erates SR ne 6 
seteteents SS 2 
eters . 
O° Bisescseninan Boeess 
0.0 Setatatatat tatatat 5 t 
0 50 100 150 200 250 300 350 400 450 500 550 600 


Fig. 5. Graph of changes in the information criterion in the process of sequential 
optimization of control tolerances 
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An analysis of Figure 5 shows that the 
maximum value of the average information 
criterion was achieved at the 300th iteration 


and is 2.61, which exceeds its value obtained 
in parallel optimization. Table 1 shows the 


results of machine learning. 


Table 1. Machine learning results for the first level of the hierarchy 


* Precision 
ESS En, a, characteristics 
recognition D; B" 
xe | *u] 430] 28 1 0 
xe | ~2}261] 22 | 086 0 
X oO 
x3 '3 | 4,39 30 1 0 
o | X4 
Xx; “| 0,71 10 0,51 0,01 
o | Xs 
Xs | 157 10 0,72 0,01 
o xy 
X6 16 | 3,72 23 0,96 0 


An analysis of Table 1 shows that the 
decision rules provide a full probability of 
correct classification of 0.92. 

In order to build error-free decision 
rules based on the training matrix, a binary 
hierarchical structure was constructed. The 
attributes of the vertices of the first level of 
the structure are the training matrices of the 
recognition classes x? and x’, for which, 


according to Table 1, error-free decision rules 


E 
4,5 


4,0 
3,5 
3,0 
2,5 
2,0 
1,5 
1,0 


0,5 


are constructed. The remaining recognition 
classes were transferred to the second level of 
the hierarchy. At this level, we implemented 
learning with  parallel- 
control 
tolerance system. Figure 6 for this case shows 
the average 


linear machine 


sequential optimization of the 
the graph of changes in 
information criterion 
parallel-serial 


tolerances. 


in the process 
optimization of 


0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 


Fig. 6. Graph of changes in the information criterion in the process of sequential optimization of control 
tolerances of the second level of the hierarchical structure 
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According to the graph in Figure 6, for Table 2 shows the results of the parallel- 
the second level of the hierarchical structure, sequential optimization of machine learning 
the maximum value of the average parameters for the second level of the 
information optimization criterion of hierarchical structure of recognition classes. 


E = 3.80 was achieved at the 450th iteration. 


Table 2. Machine learning results for the second level of the hierarchy 


7 Precision 
Class E- dn, characteristics 

recognition : D 8° 

x? x, 4,21 | 19 0,99 0 

xy X30 4,39 | 21 1 0 

XS X33 4,39 | 17 1 0 

X6 X34 2,20 | 19 0,81 0 
Thus, at the second level of the classes X53 and X¢ are moved to the third 
hierarchical structure, the recognition classes level of the hierarchy. Figure 7 shows a graph 
Xj; and X, for which error-free decision of the dependence of the average information 


criterion (5) obtained by parallel optimization 


rules have been formed, remain. Recognition 
of control tolerances. 


E 


0 5 10 15 20 25 30 35 40 45 50 55 60 


Fig. 7. Graph of dependence of the information criterion on the parameter of the control tolerance field of the 
third level of the hierarchy 


The analysis of Fig. 7 shows that for the the results of parallel-sequential optimization 
third level of the hierarchical structure, the of machine learning parameters for 
average information criterion reaches a recognition classes at all levels of the 
maximum threshold value of £ =4.39 with hierarchical structure of recognition classes. 


the optimal parameter 5 =48. Table 3 shows 
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Table 3. Machine learning results from all levels of the hierarchy 


Cl Precision 
a Level of E. a characteristics 
recognition hierarchy " : D B 
x; xy , 4,39 28 1 0 
Xx; Xi 4,39 30 1 0 
Xt Xs I 4,39 21 1 0 
Xs X39 4,39 17 1 0 
xy X31 4,39 3 1 0 
> Il 
Xe X35 4,39 10 1 0 


The analysis of Table 3 shows that the 
constructed classifiers are error-free according 
to the training matrices of the levels of the 
binary hierarchical structure (Fig. 8). 

The obtained results of applying 
information-extreme machine learning to 
hierarchically organized data open up new 
opportunities to overcome the problem of the 
multidimensional alphabet of recognition 
classes. On the example of developing a 
system for ophthalmic diagnosis of eye 
diseases based on pathology images, the 
possibility of automatic retraining of the 
system when expanding the alphabet of 


recognition classes describing the 
corresponding visual _— pathologies __is 
demonstrated. 


NE) 
XN KS) 
&) &) 


Fig. 8. Three-level hierarchical data structure 


It is known that the use of linear 
machine learning algorithms with a large 
alphabet capacity leads to a_ significant 
decrease in its accuracy due to the high degree 
of overlap between recognition classes in the 
space of diagnostic features. The proposed 
method, unlike linear machine learning, 
determines the optimal control tolerances for 
each level of the hierarchical structure of 
classes separately. At the same time, we 
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demonstrate the feasibility of using parallel- 
sequential optimization of control tolerances 
in machine learning. This leads to an increase 
in both recognition accuracy and machine 
learning efficiency, since during sequential 
optimization, the search for the global 
maximum of the information criterion is 
carried out in the working areas defined 
during parallel optimization. 


Conclusions 

The paper proposes a new approach to 
pattern recognition with a large number of 
classes based on a_ functional-categorical 
model and information-extreme machine 
learning. The approach uses a_ hierarchical 
data structure, which allows at each level to 
determine the optimal knowledge of 
functional parameters in the information sense 
and to form error-free decision rules for a part 
of the recognition classes according to the 
training matrix. Machine learning of the 
system for recognizing eye pathologies from 
images is carried out at each level of the 
hierarchy using a linear algorithm of 
sufficient depth, which ensures high 
recognition accuracy and efficiency of the 
proposed approach. 

It is important to note that the obtained 
rules are error-free according to the training 
matrices, thus achieving a sufficient depth of 
machine learning and optimizing other system 
parameters, including the parameters of 
forming the input information description, 
will not lead to an increase in the system's 
functional efficiency. This approach has 
significant potential for use in various pattern 
recognition tasks with a large number of 
classes. 
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